Saying What You're Looking For: Linguistics Meets Video Search

نویسندگان

  • Daniel Paul Barrett
  • Andrei Barbu
  • N. Siddharth
  • Jeffrey Mark Siskind
چکیده

We present an approach to searching large video corpora for clips which depict a natural-language query in the form of a sentence. Compositional semantics is used to encode subtle meaning differences lost in other approaches, such as the difference between two sentences which have identical words but entirely different meaning: The person rode the horse versus The horse rode the person. Given a sentential query and a natural-language parser, we produce a score indicating how well a video clip depicts that sentence for each clip in a corpus and return a ranked list of clips. Two fundamental problems are addressed simultaneously: detecting and tracking objects, and recognizing whether those tracks depict the query. Because both tracking and object detection are unreliable, our approach uses the sentential query to focus the tracker on the relevant participants and ensures that the resulting tracks are described by the sentential query. While most earlier work was limited to single-word queries which correspond to either verbs or nouns, we search for complex queries which contain multiple phrases, such as prepositional phrases, and modifiers, such as adverbs. We demonstrate this approach by searching for 2,627 naturally elicited sentential queries in 10 Hollywood movies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

in Foundations of Speech Act Theory

Like Humpty Dumpty, many philosophers take pride in saying what they mean and meaning what they say. Literalism does have its virtues, like when you're drawing up a contract or programming a computer, but generally we prefer to speak loosely and leave a lot to inference. Language works far more efficiently that way. Two Kinds of Looseness It helps if you can rely on people not to take you too l...

متن کامل

What We Are Talking about and What We Are Saying about It

In view of the relationships between theoretical, computational and corpus linguistics, their mutual contributions are discussed and illustrated on the issue of the aspect of language related to the information structure of the sentence, distinguishing ”what we are talking about” and ”what we are saying about it”.

متن کامل

The Structure of Memory Meets Memory for Structure in Linguistic Cognition

Title of dissertation: THE STRUCTURE OF MEMORY MEETS MEMORY FOR STRUCTURE IN LINGUISTIC COGNITION Matthew Webb Wagers Doctor of Philosophy, 2008 Dissertation directed by: Professor Colin Phillips Department of Linguistics This dissertation is concerned with the problem of how structured linguistic representations interact with the architecture of human memory. Much recent work has attempted to ...

متن کامل

Just Because You're Offended Doesn't Mean You're In The Right: A Perspective on Language, Comedy, and Ethics

Some humor is offensive, but does this convey a moral constraint on what comedians can include in their jokes? Using stand up bits and reflections on comedy from George Carlin, Louis C.K., and Doug Stanhope, various philosophies of humor, and the linguistic philosophy of H.P. Grice, I explore the given question and attempt to settle the disputes about when it is prudent to be offended, in what ...

متن کامل

The Abstractness of Artworks and its Implications for Aesthetics

What would it be like to belong to a profession whose members routinely believe many more than six impossible things before breakfast? Well, just look in the looking glass: the chances are that if you're reading this, you are already associated with the profession in question. Such are the implications of the argument to follow. We aestheticians, and our theories, are currently embroiled in all...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE transactions on pattern analysis and machine intelligence

دوره 38 10  شماره 

صفحات  -

تاریخ انتشار 2016